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ABSTRACT 


This  article  analyses  four  basic  unstructured  models  in 
program  code  as  well  as — during  decompile  controling  flow 
analysis — analyzing  the  processing  of  these  four  types  of 
unstructured  models.  This  guarantees  that  high  level  program 
readability  coming  out  of  decompilation  is  good  and  maintains 
source  program  structure.  This  article  also  introduces 
translation  to  structured  of  main  unstructured  models  as  well  as 
restoration  of  the  C  language  characteristics 
Continue  and  using  GoTo  statements  to  handle  a  number  of 
unstructured  situations . 

key  words  C  Decompile  unstructured  Models  Controling  Flow 


Decompilation  is  a  type  of  tool  for  translating  low  level 
code  to  high  level  code.  As  far  as  carrying  out  analysis  of 
controling  flow  associated  with  low  level  code  is  concerned,  it 
restores  various  types  of  control  structures.  For  example,  if- 
else  structures,  cyclical  structures.  Switch  structures,  and  so 
on,  are  one  key  part  in  decompilation. 

LOW  level  codes-for  example,  compilation  codes-are  most 
certainly  not  all  structured.  The  primary  reason  is  that,  in 
source  programs,  programmers  utilite  Goto  statements  for  their 
creation.  Moreover,  as  far  as  unstructured  factors  contained  in 
code  to  carry  out  targets  produced  after  going  through 
compilation  are  concerned,  besides  Goto  statements  utilised  by 
source  programs,  they  are  also  given  rise  to  by  break  and 
Continue  statements  in  C  language.  If  appropriate  processing  is 
not  carried  out  on  these  unstructured  factors  in  target  code. 


*  Numbers  in  margins  indicate  foreign  pagination. 
Commas  in  numbers  indicate  decimals. 
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then,  during  decompilation,  it  is  very  possible  that  the  quality 
of  translation  results  will  be  very  bad.  Because  of  this,  as  far 
as  handling  of  unstructured  problems  in  program  flow  graphs  is 
concerned,  it  is  very  important  in  control  flow  analysis. 

Processing  of  unstructured  code  includes  two  areas:  the 
first,  eliminating  unstructured  factors  in  program  code;  the 
second,  maintaining  as  much  as  possible  the  structure  of  the 
source  program.  For  example,  break.  Continue,  and  a  number  of 
necessary  GoTo  statements,  etc. 


1  PROGRAM  CODE  STRUCTURED  TRANSLATION 

Eliminating  unstructured  factors  in  program  code  then 
requires  the  carrying  out  of  structured  translations  on  them.  ^ 

According  to  the  definitions  of  William  MH.,  there  are 
three  types  of  basic  structural  flow  graphs.  They  are, 
respectively,  simple  sequence,  if-then-else,  and  while.  These 
basic  flow  graphs  are  inserted  into  sets  and  build  up  into 
complex  flow  graphs.  If  one  individual  program  graph  is 
completely  composed  of  these  three  types  of  basic  structural  f  ow 
charts,  then  the  program  graph  is  called  structured.  These  three 
types  of  basic  structured  flow  graph  are  as  shown  in  Fig.l. 


Key:(l)  Simple  Sequence  (2) 


Fig.l 

If-Then-Else  (3)  While 
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On  the  basis  of  the  analytic  induction  of  Oulsum  G.,  there 
are  four  types  of  basic  unstructured  forms.  They  are  ID,  OD,  IL, 
and  OL  as  shown  in  Fig. 2. 


ID  ■  OD  IL  OL 


Fig.  2 


With  regard  to  the  realization  of  translations  from 
unstructured  to  structured,  it  is  only  necessary  to  consider  any 
three  types  from  among  the  four  types  above.  The  reason  is  that 
any  equivalent  translation  of  unstructured  models  to  structured 
models  is  in  no  case  capable  of  occurring  independently.  The 
translation  necessarily  gives  rise  to  a  series  of  flow  graph 
model  translations.  For  example,  processes  for  the  elimination 
of  the  three  types  of  ID,  OD,  and  IL  unstructured  models  in  flow 
graphs  on  the  basis  of  a  fixed  sequence  will  then  cause  OL 
unstructured  models  existing  in  source  flow  graphs  to  also  be 
automatically  eliminated.  Therefore,  we  only  need  to  consider 
equivalent  translations  associated  with  these  three  types  of 
basic  ID,  OD,  and  IL  unstructured  models.  These  three  types  of 
translations  are  introduced  below. 

1.1  ID  Structured  Translations 


ID 


Fig.  3 


U  ■  C 


\ 


n 


F 


C 
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In  this,  q  stands  for  a  running  direction  (execute  e)  when 
decision  point  A  satisfies  condition  q.  When  condition  q 
is  satisfied,  after  executing  d,  point  B  is  arrived  at.  In  the 
same  way,  b  stands  for  operations  executed  during  the  process  of 
taking  nodal  point  E  to  B. 

On  the  basis  of  Bambridge  rules.  Fig. 3  translates  to  become 
Fig.  4. 


Fig.  4  I  b-i 

B  — - B 

1.2  OD  Structured  Translation 


c 


Fig.  5 


p.q  are  the  conditions  for  decision  points  A  and  B.  b,  c, 
d,  and  e  are  operations.  introduce  the  new  variable  p.  Add  new 
operation  P=0.  P:=p  and,  taking  the  original  point  B  shifts 
associated  with  p  conditions,  change  them  into  shifts  with 
regard  to  P.  Also,  add  turning  point  B* shifts  with  regard  to 

condition  P. 
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1.3  IL  Structured  Translation 


Fig.  6 

The  meanings  of  symbols  in  this  are  the  same  as  above. 
Structural  translation  seen  in  terms  of  equivalency  is  no 
problem.  The  reason  is  that  each  step  of  translation  is 
mathematically  equivalent.  The  entire  (illegible)  graph  also 
maintains  equivalency. 


2  UNSTRUCTURED  MODEL  PROCESSING 

With  MicroVAXII/VMS  operating  system,  we  realized  C 
decompilation  systems.  Processing  principles  for  unstructured 
codes  in  program  code  are;  eliminate  the  main  unstructured 
factors,  and,  as  much  as  possible,  maintain  original  program  code 
structures,  strengthening  easy  readability.  Below,  unstructured 
code  processing  based  on  program  graphs  is  introduced. 

2.1  Unstructured  Code  Processing  Associated  with  ID 

Due  to  the  fact  that  our  C  decompilation  system  carries  out 
control  flow  analysis  based  on  program  graph  methods,  reference 
was  made  to  the  whole  article  "Decompilation  Program  Graph  Design 
and  Control  Flow  Analysis".  Therefore,  it  is  possible  not  to 
consider  it  for  this  unstructured  type.  The  reason  is  that,  as 
far  as  this  type  of  unstructured  model  is  concerned,  in  program 
graphs,  automatic  elimination  is  possible. 
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2.2  Unstructured  Processing  Associated  with  OD 


With  regard  to  unstructured  jump  outs  from  decisions,  in 
programs  in  general,  there  may  still  be  some.  Programmers  are 
able  to  use  GoTo  statements  stemming  from  certain  types  of 
special  causes.  As  a  result,  during  processing,  we  also  did  not 
do  structured  translations,  but  used  GoTo  statements  for 
processing.  Fig. 7  then  shows  program  flow  after  processing. 


In  the  case  of  the  newly  added  junction  points  xl,  x2  in 
flow  diagrams,  if  statements  and  GoTo  statements  are  used 
together  to  handle  unstructured,  and  operation  b  taken  to  point 
B  is  separated  out  from  the  flow  graph.  As  far  as  flow  graphs 
from  E'  to  E  are  concerned,  unstructured  code  processing  is 
handled  in  the  same  way. 

2.3  Unstructured  Processing  Associated  with  L 

Treatment  of  unstructured  code  which  jumps  out  of  loops  is 
divided  into  three  situations  for  processing:  (1)  using  break 
statements  for  processing,  (2)  using  Continue  statements  for 
processing,  and  (3)  using  GoTo  statements  for  processing. 

With  break  statements,  it  is  possible  to  eliminate  numerous 
loop  outlets.  The  same  as  2.2,  use  is  also  made  of  GoTo 
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statement  processing.  Respective  introductions  are  made  below. 
Loops  use  dowhile  forms  as  examples,  while  forms  are  similar  as 
is  shown  in  Fig. 8. 


Fig.  8 


loops .  ( 1 )  Restoring  break  statement  and  eliminating 

unstructured  elements.  (2)  Restoring  Continue  statement  and 
eliminating  unstructured  elements.  (3)  Using  GoTo  statements  to 
process  unstructured  code. 

(1)  use  break  statements  to  eliminate  unstructured  code  (Fig. 9) 


Fig.  9 
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(2)  Use  Continue  statements  to  eliminate  overlapping  loops 
(Fig. 10) 


Fig. 10 


/41 


(3)  Use  Goto  statements  to  process  unstructured  code  (Fig. 11) 


2.4  Unstructured  Processing  Associated  with  IL 

Structured  transformations  must  be  carried  out  with  regard 
to  unstructured  codes  jumping  into  loops .  Because  using  GoTo 
statements  to  jump  into  loops  is  very  rarely  seen,  programmers 
generally  do  not  use  them.  Moreover,  some  high  level  languages 
basically  do  not  permit  this  type  of  situation  to  appear. 

Assume  Q  is  a  logic  variable.  Below  is  a  display  of  IL 
structured  translation  as  shown  in  Fig. 12. 

Add  the  two  expressions  Q:=l,  Q:=0. 
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Point  A  adds  conditions  to  a  decision. 
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